AITopics | St. Louis County

Collaborating Authors

St. Louis County

AI Deepfakes Are Impersonating Pastors to Try to Scam Their Congregations

WIREDJan-5-2026, 11:30:00 GMT

Religious communities around the US are getting hit with AI depictions of their leaders sharing incendiary sermons and asking for donations. Father Mike Schmitz, a Catholic priest and podcaster, addressed his congregation of more than 1.2 million YouTube subscribers in November with an unusual kind of homily. You couldn't always trust the words coming out of his mouth, Schmitz said, because sometimes they weren't really his words--or his mouth. Schmitz had become the target of AI-generated impersonation scams . "You're being watched by a demonic human," said the fake Schmitz in one video that the real Schmitz, wearing an L.L. Bean jacket over his clerical suit, included in his public service announcement as an example.

pastor, schmitz, video, (13 more...)

WIRED

Country:

North America > United States > California (0.15)
Asia > China (0.05)
North America > United States > Texas > Dallas County > Dallas (0.04)
(9 more...)

Industry:

Information Technology > Services (1.00)
Information Technology > Security & Privacy (1.00)
Media (0.95)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.31)

Add feedback

Overspecified Mixture Discriminant Analysis: Exponential Convergence, Statistical Guarantees, and Remote Sensing Applications

Bolatov, Arman, Legg, Alan, Melnykov, Igor, Nurlanuly, Amantay, Tezekbayev, Maxat, Assylbekov, Zhenisbek

arXiv.org Machine LearningNov-3-2025

This study explores the classification error of Mixture Discriminant Analysis (MDA) in scenarios where the number of mixture components exceeds those present in the actual data distribution, a condition known as overspecification. We use a two-component Gaussian mixture model within each class to fit data generated from a single Gaussian, analyzing both the algorithmic convergence of the Expectation-Maximization (EM) algorithm and the statistical classification error. We demonstrate that, with suitable initialization, the EM algorithm converges exponentially fast to the Bayes risk at the population level. Further, we extend our results to finite samples, showing that the classification error converges to Bayes risk with a rate $n^{-1/2}$ under mild conditions on the initial parameter estimates and sample size. This work provides a rigorous theoretical framework for understanding the performance of overspecified MDA, which is often used empirically in complex data settings, such as image and text classification. To validate our theory, we conduct experiments on remote sensing datasets.

artificial intelligence, convergence, machine learning, (15 more...)

arXiv.org Machine Learning

2510.27056

Country:

Asia > Middle East > Jordan (0.04)
Oceania > Australia (0.04)
North America > United States > Minnesota > St. Louis County > Duluth (0.04)
(9 more...)

Genre: Research Report > New Finding (0.88)

Industry: Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.60)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

72416ded78a439907ff72165ac9c56e0-Paper-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 21:53:40 GMT

artificial intelligence, evolutionary algorithm, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Oceania > Australia > South Australia > Adelaide (0.04)
North America > United States > Minnesota > St. Louis County > Duluth (0.04)
North America > United States > Minnesota > Saint Louis County > Duluth (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre:

Overview (0.68)
Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.93)

Add feedback

Duluth at SemEval-2025 Task 7: TF-IDF with Optimized Vector Dimensions for Multilingual Fact-Checked Claim Retrieval

Syed, Shujauddin, Pedersen, Ted

arXiv.org Artificial IntelligenceMay-20-2025

This paper presents the Duluth approach to the SemEval-2025 Task 7 on Multilingual and Crosslingual Fact-Checked Claim Retrieval. We implemented a TF-IDF-based retrieval system with experimentation on vector dimensions and tokenization strategies. Our best-performing configuration used word-level tokenization with a vocabulary size of 15,000 features, achieving an average success@10 score of 0.78 on the development set and 0.69 on the test set across ten languages. Our system showed stronger performance on higher-resource languages but still lagged significantly behind the top-ranked system, which achieved 0.96 average success@10. Our findings suggest that though advanced neural architectures are increasingly dominant in multilingual retrieval tasks, properly optimized traditional methods like TF-IDF remain competitive baselines, especially in limited compute resource scenarios.

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2505.12616

Country:

North America > United States > Minnesota > St. Louis County > Duluth (0.14)
North America > United States > Minnesota > Saint Louis County > Duluth (0.14)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.34)

Add feedback

The study of short texts in digital politics: Document aggregation for topic modeling

Nakka, Nitheesha, Yalcin, Omer F., Desmarais, Bruce A., Rajtmajer, Sarah, Monroe, Burt

arXiv.org Artificial IntelligenceMar-6-2025

Statistical topic modeling is widely used in political science to study text. Researchers examine documents of varying lengths, from tweets to speeches. There is ongoing debate on how document length affects the interpretability of topic models. We investigate the effects of aggregating short documents into larger ones based on natural units that partition the corpus. In our study, we analyze one million tweets by U.S. state legislators from April 2016 to September 2020. We find that for documents aggregated at the account level, topics are more associated with individual states than when using individual tweets. This finding is replicated with Wikipedia pages aggregated by birth cities, showing how document definitions can impact topic modeling results.

football, legislator, university, (15 more...)

arXiv.org Artificial Intelligence

2503.05065

Country:

North America > United States > Maryland > Baltimore (0.28)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
(106 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Media > News (1.00)
Media > Music (1.00)
Media > Film (1.00)
(16 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Data Science > Data Mining (0.93)

Add feedback

District Vitality Index Using Machine Learning Methods for Urban Planners

Marcoux, Sylvain, Dessureault, Jean-Sébastien

arXiv.org Artificial IntelligenceFeb-26-2025

City leaders face critical decisions regarding budget allocation and investment priorities. How can they identify which city districts require revitalization? To address this challenge, a Current Vitality Index and a Long-Term Vitality Index are proposed. These indexes are based on a carefully curated set of indicators. Missing data is handled using K-Nearest Neighbors imputation, while Random Forest is employed to identify the most reliable and significant features. Additionally, k-means clustering is utilized to generate meaningful data groupings for enhanced monitoring of Long-Term Vitality. Current vitality is visualized through an interactive map, while Long-Term Vitality is tracked over 15 years with predictions made using Multilayer Perceptron or Linear Regression. The results, approved by urban planners, are already promising and helpful, with the potential for further improvement as more data becomes available. This paper proposes leveraging machine learning methods to optimize urban planning and enhance citizens' quality of life.

dissemination area, vitality, vitality index, (16 more...)

arXiv.org Artificial Intelligence

2503.01878

Country:

North America > Canada > Quebec > Mauricie Region > Trois-Rivières (0.05)
North America > United States > Minnesota > St. Louis County > Duluth (0.04)
North America > United States > Minnesota > Saint Louis County > Duluth (0.04)
(2 more...)

Genre:

Overview (0.94)
Research Report > New Finding (0.46)

Industry:

Government (0.68)
Health & Medicine (0.50)

Add feedback

Intelligent Approaches to Predictive Analytics in Occupational Health and Safety in India

Saxena, Ritwik Raj

arXiv.org Artificial IntelligenceDec-31-2024

Concerns associated with occupational health and safety (OHS) remain critical and often under-addressed aspects of workforce management. This is especially true for high-risk industries such as manufacturing, construction, and mining. Such industries dominate the economy of India which is a developing country with a vast informal sector. Regulatory frameworks have been strengthened over the decades, particularly with regards to bringing the unorganized sector within the purview of law. Traditional approaches to OHS have largely been reactive and rely on post-incident analysis (which is curative) rather than preventive intervention. This paper portrays the immense potential of predictive analytics in rejuvenating OHS practices in India. Intelligent predictive analytics is driven by approaches like machine learning and statistical modeling. Its data-driven nature serves to overcome the limitations of conventional OHS methods. Predictive analytics approaches to OHS in India draw on global case studies and generative applications of predictive analytics in OHS which are customized to Indian industrial contexts. This paper attempts to explore in what ways it exhibits the potential to address challenges such as fragmented data ecosystems, resource constraints, and the variability of workplace hazards. The paper presents actionable policy recommendations to create conditions conducive to the widespread implementation of predictive analytics, which must be advocated as a cornerstone of OHS strategy. In doing so, the paper aims to spark a collaborational dialogue among policymakers, industry leaders, and technologists. It urges a shift towards intelligent practices to safeguard the well-being of India's workforce.

artificial intelligence, data mining, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2412.16038

Country:

North America > United States > Minnesota > St. Louis County > Duluth (0.14)
North America > United States > Minnesota > Saint Louis County > Duluth (0.14)
Asia > India > Karnataka (0.14)

Genre: Research Report (1.00)

Industry:

Materials > Metals & Mining (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Pulmonary/Respiratory Diseases (1.00)
(7 more...)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Artificial Intelligence in Traffic Systems

Saxena, Ritwik Raj

arXiv.org Artificial IntelligenceDec-16-2024

Existing research on AI-based traffic management systems, utilizing techniques such as fuzzy logic, reinforcement learning, deep neural networks, and evolutionary algorithms, demonstrates the potential of AI to transform the traffic landscape. This article endeavors to review the topics where AI and traffic management intersect. It comprises areas like AI-powered traffic signal control systems, automatic distance and velocity recognition (for instance, in autonomous vehicles, hereafter AVs), smart parking systems, and Intelligent Traffic Management Systems (ITMS), which use data captured in real-time to keep track of traffic conditions, and traffic-related law enforcement and surveillance using AI. AI applications in traffic management cover a wide range of spheres. The spheres comprise, inter alia, streamlining traffic signal timings, predicting traffic bottlenecks in specific areas, detecting potential accidents and road hazards, managing incidents accurately, advancing public transportation systems, development of innovative driver assistance systems, and minimizing environmental impact through simplified routes and reduced emissions. The benefits of AI in traffic management are also diverse. They comprise improved management of traffic data, sounder route decision automation, easier and speedier identification and resolution of vehicular issues through monitoring the condition of individual vehicles, decreased traffic snarls and mishaps, superior resource utilization, alleviated stress of traffic management manpower, greater on-road safety, and better emergency response time.

evolutionary algorithm, machine learning, real time system, (19 more...)

arXiv.org Artificial Intelligence

2412.12046

Country:

North America > United States > New Jersey (0.14)
Asia > Singapore (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
(17 more...)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Transportation > Infrastructure & Services (1.00)
Transportation > Ground > Road (1.00)
Transportation > Ground > Rail (1.00)
Government > Regional Government > North America Government > United States Government (0.67)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(6 more...)

Add feedback

Emergent field theories from neural networks

Vanchurin, Vitaly

arXiv.org Artificial IntelligenceNov-12-2024

We establish a duality relation between Hamiltonian systems and neural network-based learning systems. We show that the Hamilton-Jacobi equations for position and momentum variables correspond to the equations governing the activation dynamics of non-trainable variables and the learning dynamics of trainable variables. The duality is then applied to model various field theories using the activation and learning dynamics of neural networks. For Klein-Gordon fields, the corresponding weight tensor is symmetric, while for Dirac fields, the weight tensor must contain an anti-symmetric tensor factor. The dynamical components of the weight and bias tensors correspond, respectively, to the temporal and spatial components of the gauge field.

artificial intelligence, equation, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2411.08138

Country:

North America > United States > New York (0.04)
North America > United States > Minnesota > St. Louis County > Duluth (0.04)
North America > United States > Minnesota > Saint Louis County > Duluth (0.04)
(2 more...)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

K-Means Clustering With Incomplete Data with the Use of Mahalanobis Distances

Armah, Lovis Kwasi, Melnykov, Igor

arXiv.org Artificial IntelligenceOct-30-2024

Effectively applying the K-means algorithm to data with missing values remains an important research area due to its impact on applications that rely on K-means clustering. Recent studies have shown that integrating imputation directly into the K-means algorithm yields superior results compared to handling imputation separately. In this work, we extend this approach by developing a unified K-means algorithm that incorporates Mahalanobis distances, instead of the traditional Euclidean distances, which previous research has shown to perform better for clusters with elliptical shapes. We conduct extensive experiments on synthetic datasets containing up to ten elliptical clusters, as well as the IRIS dataset. Using the Adjusted Rand Index (ARI) and Normalized Mutual Information (NMI), we demonstrate that our algorithm consistently outperforms both standalone imputation followed by K-means (using either Mahalanobis or Euclidean distance) and recent K-means algorithms that integrate imputation and clustering for handling incomplete data. These results hold across both the IRIS dataset and randomly generated data with elliptical clusters.

artificial intelligence, k-means, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2411.0087

Country:

North America > United States > Minnesota > St. Louis County > Duluth (0.04)
North America > United States > Minnesota > Saint Louis County > Duluth (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Information Technology > Security & Privacy (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Add feedback